Improvement of the Speech Recognition under the Noisy Environment using the Wavelet Transform

نویسندگان

Yoichi MIDORIKAWA

Masanori AKITA

چکیده

Signal pattern recognition is one of the important technologies in this century. Speech recognition is very important for human interface with computer and machine. However, these speech recognition methods have a weak point, they work best in a noiseless condition. Automatic speech recognition systems are most effective in noiseless environments. If the data are polluted with noise, these speech recognitions are extremely difficult. For the noise reduction of signals, there are filters and spectral subtraction method, and so on. However, there is some limitation in case that the quality of the signal is poor. We propose a method for automatic speech recognition with signals contaminated with colored noise using modification of the spectral envelope shape. The proposed method is based on cepstral analysis. We have proposed the modified rules for adding valleys and recovering valleys. There is plenty of scope for improvement. In this paper, we apply the wavelet transform to the modification of spectral envelope shape. The wavelet transform is used for the extraction of particular features in the frequency and time domains. The wavelet transform is widely used for wave and image analysis. We apply a wavelet transform to speech recognition under noisy environments using cepstral analysis. As a result, speech recognition rate is improved. And data is compressed by the wavelet transform. It was shown that the wavelet analysis was one of the promising methodologies for the pattern recognition noisy signal.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

Automatic Speech Recognition In Noisy Environments Using Wavelet Transform

The performance of speech recognition systems is mainly determined by the used acoustic feature extraction technique. Two techniques are known, namely the full-band approach and the multi-band approach using filter banks. Systems using either approach usually suffer from performance degradation in the presence of noise. In this paper, the multi-band approach using Wavelet transform is suggested...

متن کامل

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Automatic recognition of speech emotional states in noisy conditions has become an important research topic in the emotional speech recognition area, in recent years. This paper considers the recognition of emotional states via speech in real environments. For this task, we employ the power normalized cepstral coefficients (PNCC) in a speech emotion recognition system. We investigate its perfor...

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

Performance of speech recognition systems is greatly reduced when speech corrupted by noise. One common method for robust speech recognition systems is missing feature methods. In this way, the components in time - frequency representation of signal (Spectrogram) that present low signal to noise ratio (SNR), are tagged as missing and deleted then replaced by remained components and statistical ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Improvement of the Speech Recognition under the Noisy Environment using the Wavelet Transform

نویسندگان

چکیده

منابع مشابه

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Automatic Speech Recognition In Noisy Environments Using Wavelet Transform

Speech Emotion Recognition Based on Power Normalized Cepstral Coefficients in Noisy Conditions

Speech Emotion Recognition Using Scalogram Based Deep Structure

روشی جدید در بازشناسی مقاوم گفتار مبتنی بر دادگان مفقود با استفاده از شبکه عصبی دوسویه

عنوان ژورنال:

اشتراک گذاری